ROSA: R Optimizations with Static Analysis

نویسندگان

  • Rathijit Sen
  • Jianqiao Zhu
  • Jignesh M. Patel
  • Somesh Jha
چکیده

R is a popular language and programming environment for data scientists. It is increasingly co-packaged with both relational and Hadoop-based data platforms and can often be the most dominant computational component in data analytics pipelines. Recent work has highlighted inefficiencies in executing R programs, both in terms of execution time and memory requirements, which in practice limit the size of data that can be analyzed by R. This paper presents ROSA, a static analysis framework to improve the performance and space efficiency of R programs. ROSA analyzes input programs to determine program properties such as reaching definitions, live variables, aliased variables, and types of variables. These inferred properties enable program transformations such as C++ code translation, strength reduction, vectorization, code motion, in addition to interpretive optimizations such as avoiding redundant object copies and performing in-place evaluations. An empirical evaluation shows substantial reductions by ROSA in execution time and memory consumption over both CRAN R and Microsoft R Open.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Static Analysis of Dynamic Schedules and Its Application to Optimization of Parallel Programs

Effective optimizations for concurrent programs require the compiler to have detailed knowledge about the scheduling of parallel tasks at runtime. Currently, optimizations for parallel programs must define their own models and analyses of the parallel constructs used in the source programs. This makes developing new optimizations more difficult and complicates their integration into a single op...

متن کامل

The Limits of Alias Analysis for Scalar Optimizations

In theory, increasing alias analysis precision should improve compiler optimizations on C programs. This paper compares alias analysis algorithms on scalar optimizations, including an analysis that assumes no aliases, to establish a very loose upper bound on optimization opportunities. We then measure optimization opportunities on thirty-six C programs. In practice, the optimizations are rarely...

متن کامل

Energy Absorption Analysis and Multi-objective Optimization of Tri-layer Cups Subjected to Quasi-static Axial Compressive Loading

In this paper, the energy absorption features of tri-layer explosive-welded deep-drawn cups subjected to quasi-static axial compressive loading are investigated numerically and experimentally. To produce the cups, tri-layer blanks composed of aluminum and stainless steel alloys were fabricated by an explosive-welding process and formed by a deep drawing setup. The quasi-static tests were carrie...

متن کامل

Taxonomical study of section Caninae )Rosa) and their hybrids in Iran

The genus Rosa has a wide variety in Iran and although some natural hybrids have been found in different regions of the country, but no cultivars have been established so far. In order to optimize use of genetic resources in the country and to create varieties with desirable traits, inter-species relationships and identification of diversity, 45 populations from seven species in sectio...

متن کامل

Optimizing Typed Feature Structure Grammar Parsing through Non-Statistical Indexing

This paper introduces an indexing method based on static analysis of grammar rules and type signatures for typed feature structure grammars (TFSGs). The static analysis tries to predict at compile-time which feature paths will cause unification failure during parsing at run-time. To support the static analysis, we introduce a new classification of the instances of variables used in TFSGs, based...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1704.02996  شماره 

صفحات  -

تاریخ انتشار 2017